AITopics | Graphics

GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification: Supplementary Material

Neural Information Processing SystemsMay-29-2025, 03:32:58 GMT

The following summarizes the major operations of the GPU-optimized TRON logistic regression solver, TRON-LR-GPU, as described in the main paper and herein. For each set of operations, the original lines from Algorithm 1 being optimized are listed in red.

artificial intelligence, hessian-vector product, machine learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.16)
North America > Canada (0.14)

Genre:

Research Report > New Finding (0.37)
Research Report > Experimental Study (0.37)

Technology:

Information Technology > Hardware (0.95)
Information Technology > Graphics (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Neural Information Processing SystemsMay-28-2025, 18:58:44 GMT

Human image animation involves generating videos from a character photo, allowing user control and unlocking the potential for video and movie production. While recent approaches yield impressive results using high-quality training data, the inaccessibility of these datasets hampers fair and transparent benchmarking. Moreover, these approaches prioritize 2D human motion and overlook the significance of camera motions in videos, leading to limited control and unstable video generation. To demystify the training data, we present HumanVid, the first large-scale high-quality dataset tailored for human image animation, which combines crafted real-world and synthetic data. For the real-world data, we compile a vast collection of real-world videos from the internet.

artificial intelligence, machine learning, video, (15 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
Asia > China (0.14)
North America > United States (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Television (0.52)
Media > Photography (0.52)
Media > Film (0.52)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

1909ac72220bf5016b6c93f08b66cf36-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMay-28-2025, 17:37:10 GMT

artificial intelligence, machine learning, nighttime, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Graphics (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

185fdf627eaae2abab36205dcd19b817-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMay-28-2025, 17:17:24 GMT

artificial intelligence, dataset, machine learning, (15 more...)

Neural Information Processing Systems

Country: Europe (0.14)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

2082273791021571c410f41d565d0b45-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-28-2025, 15:12:51 GMT

Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? In Section 4.1, we briefly introduced how humans annotate the reconstructed images for different datasets. In the supplementary material, we have included a graphical user interface (GUI) that was utilized by the annotators. Figure 1 displays the GUI, where (A) and (B) were specifically designed for annotating different datasets. To minimize the influence of subjective bias, we use a relatively objective formulation: whether the reconstructed image can be correctly labeled.

artificial intelligence, machine learning, similarity, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Graphics (0.76)
Information Technology > Sensing and Signal Processing > Image Processing (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)

Add feedback

DMesh: A Differentiable Mesh Representation Yang Zhou 2

Neural Information Processing SystemsMay-28-2025, 13:56:23 GMT

We present a differentiable representation, DMesh, for general 3D triangular meshes. DMesh considers both the geometry and connectivity information of a mesh. In our design, we first get a set of convex tetrahedra that compactly tessellates the domain based on Weighted Delaunay Triangulation (WDT), and select triangular faces on the tetrahedra to define the final mesh. We formulate probability of faces to exist on the actual surface in a differentiable manner based on the WDT. This enables DMesh to represent meshes of various topology in a differentiable way, and allows us to reconstruct the mesh under various observations, such as point clouds and multi-view images using gradient-based optimization.

artificial intelligence, machine learning, mesh, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Graphics (0.67)

Add feedback

Omnigrasp: Grasping Diverse Objects with Simulated Humanoids Zhengyi Luo 1,2 Sammy Christen 2,3 Alexander Winkler 2

Neural Information Processing SystemsMay-28-2025, 07:32:02 GMT

We present a method for controlling a simulated humanoid to grasp an object and move it to follow an object's trajectory. Due to the challenges in controlling a humanoid with dexterous hands, prior methods often use a disembodied hand and only consider vertical lifts or short trajectories. This limited scope hampers their applicability for object manipulation required for animation and simulation. To close this gap, we learn a controller that can pick up a large number (>1200) of objects and carry them to follow randomly generated trajectories. Our key insight is to leverage a humanoid motion representation that provides human-like motor skills and significantly speeds up training.

machine learning, natural language, trajectory, (16 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Learning to See Physics via Visual De-animation

Jiajun Wu, Erika Lu, Pushmeet Kohli, Bill Freeman, Josh Tenenbaum

Neural Information Processing SystemsMay-28-2025, 01:05:34 GMT

We introduce a paradigm for understanding physical scenes without human annotations. At the core of our system is a physical world representation that is first recovered by a perception module and then utilized by physics and graphics engines. During training, the perception module and the generative models learn by visual de-animation -- interpreting and reconstructing the visual information stream. During testing, the system first recovers the physical world state, and then uses the generative models for reasoning and future prediction. Even more so than forward simulation, inverting a physics or graphics engine is a computationally hard problem; we overcome this challenge by using a convolutional inversion network. Our system quickly recognizes the physical world state from appearance and motion cues, and has the flexibility to incorporate both differentiable and non-differentiable physics and graphics engines. We evaluate our system on both synthetic and real datasets involving multiple physical scenes, and demonstrate that our system performs well on both physical state estimation and reasoning problems. We further show that the knowledge learned on the synthetic dataset generalizes to constrained real images.

engine, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(2 more...)

Add feedback

RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Thu H. Nguyen-Phuoc, Chuan Li, Stephen Balaban, Yongliang Yang

Neural Information Processing SystemsMay-26-2025, 07:07:02 GMT

Traditional computer graphics rendering pipelines are designed for procedurally generating 2D images from 3D shapes with high performance. The nondifferentiability due to discrete operations (such as visibility computation) makes it hard to explicitly correlate rendering parameters and the resulting image, posing a significant challenge for inverse rendering tasks. Recent work on differentiable rendering achieves differentiability either by designing surrogate gradients for non-differentiable operations or via an approximate but differentiable renderer. These methods, however, are still limited when it comes to handling occlusion, and restricted to particular rendering effects. We present RenderNet, a differentiable rendering convolutional network with a novel projection unit that can render 2D images from 3D shapes. Spatial occlusion and shading calculation are automatically encoded in the network. Our experiments show that RenderNet can successfully learn to implement different shaders, and can be used in inverse rendering tasks to estimate shape, pose, lighting and texture from a single image.

artificial intelligence, machine learning, rendernet, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Europe (0.14)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Neural Gaffer: Relighting Any Object via Diffusion Yuan Li2

Neural Information Processing SystemsMay-25-2025, 21:46:49 GMT

Single-image relighting is a challenging task that involves reasoning about the complex interplay between geometry, materials, and lighting. Many prior methods either support only specific categories of images, such as portraits, or require special capture conditions, like using a flashlight. Alternatively, some methods explicitly decompose a scene into intrinsic components, such as normals and BRDFs, which can be inaccurate or under-expressive.

diffusion model, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Media (0.46)

Technology: